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1.  Summary.   In  this  paper  we  offer  nonparametric  competitors  of  some  classical 
tests  of  multidimensional  independence  considered  by  Wilks  (1935),  Daly  (1940) 
and,  Wald  and  Brookner  (1941).   In  this  context,  the  problems  of  testing  ( i)  the 
mutual  independence  of  q  subsets  of  the  totality  of  p  variates  (p  >  q  >  2),  and 
(ii)  pair  wise  independence  of  p  variates  [cf.  Anderson  (1957)  Chapter  9]  are 
considered,  all  against  appropriate  classes  of  stochastic  dependence  alterna- 
tives.  Some  strictly  distribution  free  permutation  tests  are  developed  here, 
and  their  asymptotic  properties  are  studied  with  the  aid  of  a  theorem  on  the 
asymptotic  distribution  of  a  class  of  rank  order  statistics  to  the  case  of  more 
than  two  variates. 

2.  Introduction.   Let  X  =  (X  ,...,X  ),  a=l,  ...,n  be  n   i.i.d.r.v's  (indepen- 
dent and  identically  distributed  (vector  valued)  random  variables) having  a  p(>  2) 
variate  continuous  cdf  (cumulative  distribution  function)  F(x),  where  x  = 

(Xt,...,x  )  eR  ,  the  p-dimensional  Euclidean  space.   Let  X  be  partitioned  into 
1      p  ~a 

q(>  2)  subvectors;  that  is 

(2.1)  X  =  (X^^^,  ...,X^'J^;   a=l,...,n 
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where  X    is  of  i .  components;  j=l,  ...,q  and  E  i .  =  p.   Let  the  marginal  cdf  of  X 
be  denoted  by  F^-^^(x^^O  where  x^-^^  e  R  \  j=l>--->q-   Our  first  problem 
relates  to  the  stochastic  independence  of  the  q  subsets  of  variates.   This  may 
be  statistically  framed  by  means  of  the  null  hypothesis 

(2.2)  H^*^^   F(x)  =  IT  F^J\x^J^)   for  all  x  e  R^. 

J=l 

Let  us  denote  the  marginal  cdf  of  X.   by  Fr  .i(x),  i=l, ...,p;  and  the  marginal  joint 

cdf  of  (X.  ,  X.  )  by  Fr.  .n(x,y)  for  all  i=^  i=l,  .  .  . ,  p.   X.  ,...,X   are  said  to  be 
la'   ja      Li,  jj  la,'      pa 

totally  independent  if 

(2.3)  H^P^:   F(x)  =  |T  Fr.T(x)   for  all  x  e  R^  . 

Thus  H    is  a  particular  case  of  H  ^  when  q=p,  that  is  when,  i  =...,i  =1.   Similarly, 

X,  ,...,X   are  said  to  be  pair  wise  independent  if 
la'    '  pa 

(2.4)  H^*^   F^.  .j(x,y)'=  F^.^(x)  F^^^(y)  for  (x,  y)  e  R^  , 

and  for  all  i^^  j=l,  .  .  .,  p.   (It  may  be  noted  that  (2.3)  implies  (2.4)  but  not  the 
vice-versa) . 

In  the  particular  case  of  F(x)  being  a  multinormal  cdf,  pair  wise  independence 
implies  total  independence  and  vice-versa.   Moreover  in  this  case  uncorrelation 
implies  independence  and  vice-versa.   Also,  the  multinormal  cdf  is  completely 
specified  by  its  mean  vector  and  the  covariance  matrix.   Thus,  all  the  three 
problems  stated  above  reduce  to  certain  specific  structures  of  the  covariance 
matrix.   If  the  covariance  matrix  is  partitioned  into  q"^  submatrices  of  orders 
i.  X  i  ;  j,i=l, ...,q,  then  the  first  hypothesis  H  ^   is  equivalent  to  the 


hypothesis  that  all  the  off-diagonal   partitioned  matrices  are  the  null  matrices, 
while  the  hypotheses  H    and  H*  are  both  equivalent  to  the  hypothesis  that  the 
covariance  matrix  itself  is  a  diagonal  matrix.   Tests  for  these  hypotheses  are 
considered  in  Anderson  (1957,  Chapter  9),  and  the  reader  is  referred  to  it  for 
details  of  background  and  motivation. 

The  object  of  the  present  investigation  is  to  explore  the  possibility  of 
generalizing  the  above  problems  in  a  completely  nonparametric  set  up.   Now,  for 
arbitrary  (continuous)  multivariate  distributions  the  covariance  matrix  may  not 
exist,  and  even  if  it  exists,  it  may  not  play  the  fundamental  role  as  in  the  case 
of  the  multinormal  distributions.   For  this  reason,  we  shall  formulate  a  class 
of  association  parameters  which  are  regular  functionals  of  the  cdf  F(x)  and  which 
are  defined  for  a  much  wider  class  of  cdf's.   This  will  provide  us  with  a  suitable 
nonparametric  competitors  of  the  classical  covariance  (or  correlation)  matrix 
and  also  increase  the  scope  of  the  methods.   Secondly,  for  multinormal  cdf's 
uncorrelation  and  independence  are  equivalent,  but  the  same  is  not  true  for 
arbitrary  cdf»s.   Thus,  to  be  precise  about  the  class  of  alternatives,  we  will 
also  define  some  dependence  function  with  some  emphasis  on  association  alternatives. 
The  proposed  tests  will  be  shown  to  be  consistent  and  reasonably  efficient  for 
such  alternatives.   Finally,  for  arbitrary  cdf»s,  since  pairwise  independence 
does  not  inply  total  independence  (cf.  Geisser  and  Mantel  (1962)),  we  shall  also 
consider  the  problem  of  testing  H*,  defined  by  (2.4),  in  some  detail.   In  this 
context,  we  shall  extend  a  theorem  by  Bhuchongkul  (1964)  on  the  asymptotic  dis- 
tribution of  (association)  rank  order  statistics  to  the  multivariate  case,  and 
subsequently  use  it  to  study  the  asymptotic  properties  of  the  proposed  tests. 

3.   Formulation  of  a  class  of  association  _paramgters^.   Apart  from  the  necessity 

of  the  existence  of  the  second  order  moments,  the  estimator  of  the  usual  covariance 

matrix  are  quite  sensitive  to  outlying  observations.   Here  we  shall  consider  some 


alternative  measures  of  association  which  are  really  functionals  of  the  parent 
cdf's.   For  this,  we  consider  the  marginal  cdf's  Fr  .  t  and  Fr  .  .-i  (i^  j=l,  .  .  . ,  p) 
as  in  section  2.   Also,  let  J, .v(u),  0  <  u  <  1   (i=l,  ,..,p)  be  some  absolutely 
continuous  function  defined  on  the  open  interval  (0,1)  and  suppose  that  J...(u) 
is  normalized  in  the  following  manner: 


1  1 

(3.1)         /  J..v(u)  du  =  0   and   /  J^   (u)  du  =  1   for  i=l, 
0   ^^^  0   ^^^ 


Later  on  we  shall  impose  certain  regularity  conditions  on  J, ..(u).   Let  us 
now  consider  the  transformed  variables 


(3.2)  "^i  "  '^(i)^^[i]^''i^^       for  i=l,  ...,p 
and  denote 

(3.3)  Y=  (Y^,...,Yp) 

Y.  will  be  called  the  grade  functional  of  X.:  i=l, ...,p.   The  joint  distribution 
of  Y  can  be  obtained  from  F(x),  but  it  will  naturally  depend  upon  the  associa- 
tion pattern  of  F(x).   If  J-.x(u)  is  not  constant  for  all  0  <  u  <  1,  (i=l,  ...,p) 
it  is  easy  to  verify  that  independence  of  (X.,  X.)  implies  the  independence  of 
(Y.,  Y.)  and  vice-versa.   Further  this  property  is  preserved  under  any  monotone 
transformation  of  X.;  i=l, ...,p.   So  when  we  are  interested  in  the  problems  of 
stochastic  independence  we  may  work  with  Y  ,  ofI, ...,n,  and  suitably  choosen 
J^  . V ;  i=l, ...,p,  often  lead  to  some  nice  properties  of  such  alternative  measures. 
Let  us  now  define 

00    00 

(3.4)  e.  .  =   /  /  >J(i)<F[i](^))  J(j)(F[j](y))  dF[i^j](^,y);   i,j=l,...,P 
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It  may  be  noted  that  by  (3.1),  0..  =  1  for  all  i=l,  ...,p  and  |9..|  <  1  for  all 

i^j=l,  ...,p.   Further,  if  X.   and  X.   are  independent,  then  from  (3.1)  and  (3.4), 

we  see  that  0. .  =  0.   In  fact  G. .  is  the  product  moment  correlation  of  Y.   and 
ij  ij  la 

Y.  .   Thus,  we  define  a  nonparametric  association  matrix  by 
ja 

(3.5)  (H)=  ((9   )) 

Naturally,  the  choice  of  the  transformations  J/xj  ^~^j  - • • }V   plays  an 
important  role  in  the  procedure  to  be  considered.   We  shall  consider  a  general 
class  of  such  functionals  and  subsequently  we  shall  briefly  consider  the  problem 
of  selecting  some  particular  measures. 

Now,  usually  the  cdf«s  Fr.-i,  i=l,...,p  are  all  unknown,  and  hence  Y  is  also 
unknown.   However,  (3.4)  has  a  close  analogy  with  the  type  of  functionals  considered 
by  Von  Mises  (1947),  and  following  his  line  of  approach  we  shall  frame  suitable 
estimators  of  ^) .   With  this  end  in  view,  let  us  define 

(x) 

(3.6)  ^  f'l    ~  (number  of  X.   <  x  ,  a=l,  . .  .  ,n)/n;  i=l,  ...,p 

(3  7)    F       ^'  ^ 

n[i,  j]      =  (number  of  (X^^,  X  )  <  (x,  y),  ai=l,  .  .  .,n)/n;  i/j=l,  ...,p. 

Then  we  may  consider  J/.v(F  r.i(x))  as  a  natural  estimator  of  J/ . s (Fr  . ^ (x)) ; 

(i)^  nLiJ^  "  (x)   L ij 

i=l,  ...,p.   Further  in  actual  practice,  often  we  have  a  sequence  of  functions 

viz.  {J  ,..(u)}  which  converges  to  J,  ..(u)  for  all  0  <  u  <  1  as  n^»o,  and  i=l,  ...,p. 

Thus  for  the  sake  of  more  generality  in  presentation,  we  may  consider  the 

estimators 

(^•«>  '^n(i)<^n[i]<^»  • 

Consequently,  from  (3.4),  (3.6),  (3.7)  and  (3.8),  we  may  frame  the  esti- 
mator of  e. .  as  T   . .  where 


'    -^       -00   -00 

1    " 

n  _,  n(i)^  n[i]^  la'   n(j)^  nLjJ   ja   '   '  "^   ' 

CO-  i 

It  may  be  noted  that  T   ..  are  nonstochastic  (if  J  ,.^'s   are  so)  and  by  (3.1), 
■'  n,  11  n^i; 

they  converge  to  unity  as  n-^oo.   Our  proposed  tests  are  based  on  suitable  functions 

of  the  stochastic  matrix  T  ,  when 

<«n 

(3.10)  T  =  ((T   .  .)).  .  T 

^     '^  "n   ^^  n,  ij"i,  j=l,  ...,p 

In  the  sequel,  T  and  @  will  be  termed  respectively  the  sample  and  popu- 
lation  dispersion  matrices.   It  may  be  noted  that  many  well  known  nonparametric 
measures  of  association  belong  to  the  class  considered  above.   For  example,  if 
we  let 

(3.11)  J  ...(a/(n+l))  =  ("1^)2  {a  -  ^} ;   ofI,  ...,  n:  i=l,  ...,p 

n(i)   '  n'^-l         2 

then  T     reduces  to  Spearman's  rank  correlation  (cf.  Kendall  (1955))  while 
n,  ij 

for  J,  ..  (u)  =  n/Tz  (u-i),  0  <  u  <  1;   i=l,  .  .  .,p   6.  .  reduces  to  the  grade 
correlation  coefficient  (cf.  Hoeffding  (1948),  pp.  318).   The  measures  con- 
sidered by  Blomquist  (1950)  and  Bhuchongkul  (1964)  among  others,  also  belong  to 
this  class. 

We  have  so  far  tried  to  provide  a  heuristic  argument  for  measuring  association 
by  the  functionals  6. .'s  defined  by  (3.4).   We  may  also  justify  the  use  of  such 
functionals  if  we  adopt  Hajek's  approach  (1962).        However,  his  study 
specifically  relates  to  linear  regression  alternatives  whereas  we  are  more 
interested  in  association  alternatives,  and  apparently  his  procedure  seems  to 


have  very  little  impact  on  ours.   But,  in  spite  of  his  logic,  one  may  propose 
some  measures  of  association  which  are  particular  cases  of  the  general  class 
of  (h)  . 

Again,  in  multinormal  distributions  any  stochastic  association  is  reflected 
by  the  correlation  matrix.   In  general,  deviation  from  independence  may  not  be 
reflected  by  correlation.   Thus  to  consider  the  class  of  alternative  hypotheses, 
we  will  also  consider  some  dependence  function  (cf.  Sibuya   (1959)). 

Denote 


KX)   =  H^)/       \]     Fr.A^.] 


(3.12)  fi(x)  =  F(x)/  I  I  F^i](x.) 

(3.13)  a.j(x,y)  =  F^.^jj(x,y)/F^.-j(x.)  F[-jj(y),    i/j=l,...,p. 

Both  n  and  fj.  .  are  functions  of  the  marginal  distributions  as  well  as  the  joint 

ones.   So  we  shall  prefer  to  write  them  as  n(Fr  ,  n .....  Fr  -,)    and  n .  .  (Fr  .  n ,  Fr  .  n  ) 

L IJ    '  LpJ      ij  LiJ  L  jJ 

respectively.   Hereafter  we  shall  term  Q,   and  n.  .  as  dependence  functions.   (It 
may  be  noted  that  if  X  ,  ...,X  are  totally  independent  (or  if  X.  and  X.  are  in- 
dependent), then  n(or  0. .)  will  be  equal  to  unity  of  all  x  (or  all  x.,  x.)  and 
so,  the  divergence  from  unity  on  a  set  of  points  of  measure  non-zero  relates  to 
stochastic  dependence).   In  subsequent  sections,  we  shall  make  repeated  use  of 
this  type  of  dependence  functions. 

4.   Perrautationally  distribution-free  tests  for  H   .   It  may  be  noted  that  if 

X.  and  X.  belong  to  two  different  subsets,  then  under  H    in  (2.2),  G. .  =  0. 

Let  us  now  denote  by  J'  the  class  of  all  r-variate  continuous  cdf's  for  r=l,  ...,p; 

so  that  F(x)  e  ^  . 
P 

Let  us  define 


(4.1)     ^  °  =  {F(x):  F(x)  = 


j=l 


p(j)(^(j))^   pCj)  , 


T.    ,       j=l,  .  .  .,q] 


Thus  ^   is  a  subclass  of  all  p-variate  continuous  cdf s  for  which  the  q  specified 
subsets  of  variates  are  mutually  stochastically  independent.   Thus,  one  may  write 
equivalently 


(4.2) 


H^''^   F(x)  eT?<°> 
o       •>-    ^  p 


and  may  be  interested  in  the  set  of  alternatives  that  F(x)  e  3r  -3f  .   However, 

"      P   P 

such  a  formulation  will  give  rise  to  considerable  amount  of  mathematical  com- 
plications since  F(x)  is  otherwise  of  completely  unknown  form.   In  this  paper, 
we  shall  restrict  ourselves  to  the  class  of  alternatives  which  relate  to  the  lack 
of  independence  through  the  values  of  the  functional  matrix  HIJ  defined  in  (3.5). 
Let  us  now  partition  Qy  as 


©11      @12        ••• 

®21       ©22        •••       ®2 


®      = 


®.l      ®.2 


Vqq 


where  @ ,  .  is  of  the  order  i  x  i  ;  k,£=l,  ...,q.   Now  from  (4.2),  we  have 
(4.4)        ®kg  =  £   f^'^   lf£=l,...,q   under  H^'l^ 


We  frame  the  class  of  alternatives  as  the  set  of  all  p-variate  cdf's  for  which 
the  equality  sign  in  (4.4)  does  not  hold  for  at  least  one  pair  (k,£),  l^i=l, ...,q. 
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Now  recalling  the  definition  of  0. .  in  (3.4)  and  n. .  in  (3.13),  we  may  write 


(4.5)   e..  =   /  /CF^i,j](x,y)  -F^i](x)  F^j^(y)]  jJ,)[F^,](x)]j;.)[F^.^(y)] 


dF^.j(x)  dF,  .j(y) 


=  /  /  F^ij(x)  F^j](y)  [n.  .(F^.3(x),  F^  .j(y))-ll  jJ,)[F^  ,j  (x)  ] 


^o)^^[j]^>'>^  '^^[ij^^^  ''ii]^y^ 


(Here  J  denotes  the  first  derivative  of  J). 

In  most  of  the  cases  (as  we  shall  see  later  on)  J, .,(u)  is  monotone  in 
u,  0  <  u  <  1  for  i=l,  ...,p,  so  that  if  J2..(x,y)  is  uniformly  (in  x,  y)  greater 
than  (less  than)  or  equal  to  1  with  the  strict  inequality  on  at  least  a  set  of 
points  of  measure  nonzero,  then  9..^0.   In  many  types  of  factorial  dependence 
(cf.  Konijn  (1956),  Bhuchongkul  (1964)  for  the  bivariate  case),  it  is  easy  to 
verify  that  fi. .  is  greater  than  or  less  than  one,  depending  upon  the  coefficients 
of  the  stochastic  factors,  and  hence  such  an  alternative  may  be  quite  suitable. 
Incidentally,  we  are  not  restricting  ourselves  to  the  particular  type  of  the 
alternatives  of  linear  dependence  as  considered  by  Konijn  (1956)  and  Bhuchongkul 
(1964)  (in  the  particular  case  of  bivariate  distributions),  in  fact,  they  can  be 
shown  to  be  contained  in  the  class  of  alternatives  considered  in  this  paper. 

Now  for  the  simplicity  of  presentation,  we  shall  consider  in  detail  the 
tests  for  H  ^  when  q=2,  and  then  we  shall  indicate  briefly  how  the  theory  can 
be  extended  to  the  case  of  more  than  two  subsets.   Let  then 

(4.6)    X  =  iX^^\    X^^^;    X^i^  =  (X,  ,  •  •  • ,  X„  );    X^^^  =  (X„^,   ,...,X   ) 

«a    ~a  '  -a    '     a^      la'    '    ia.  -a       ^+l,a      pa 
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where  1  <  & ,    upp-H   <  p,    and  ar=l,  ...,n. 


We  denote  the  sample  point  E  by 


pxn  _  ,    , 


(4.7)   E^'""  =  (X*  ...,X') 
•«n      "wF     "-n 


„(!)•      „(1)'^ 


„(2)'      „(2)' 

A,     ,  .  .  .  ,   A 

f»i        '         '  ~n 


pxn  . 


The  ioint  distribution  function  of  E     is  given  by 
■^  *<n 


(4.8) 


G(E  ) 


n 

ar=l 


F(x  ) 

•»cc 


and  by  (2.2),  under  H 


(2) 


(4.9) 


G(E  )  =  fy  {F<l>(x(^> 

wn     I  I       wtt 

OFl 


Let  now  R  =  (R  ,  lU,  ...,R  )  be  any  permutation  of  (1, 2, ...,n)  and  we  denote 
by  3{  ,  the  set  of  all  the  n.»  permutations  of  (1, 2,  ...,n).  Furthermore,  let 
us  also  denote 


(4.10) 


E(R  )  = 
<v  ^n 


x;^^',...,  x<i>" 

•vl   '    '  -xn 


y(2)'     „(2)' 
1         n 


'  »vn      n 


(4.11) 


S(E  )  =  {E(R  )  :  R  e   2  } 
«wn     »«  'xn    <«n     n 


Then,  it  follows  from  (4.9)  and  (4.10)  that  the  joint  distribution  of  E(R  ) 

(2) 
remains  invariant  under  the  group  of  transformations  R  e  JJ   if  H    holds. 

mn    -n     o 
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Consequently,  we  shall  call  S(E  ),  the  permutation  invariant  set  of  E  .   Let 
-1     J'  ^n  -  i ^jj 

now  5   be  the  n  p  dimensional  Euclidean  space,  so  that  E  e  S  .   Now  under 
~n  '         --n   -'n 

(2) 
H   ,  the  probability  distribution  of  E   over  the  set  of  n?  points  in  S(E  )  is 
o     '  ^  ■'  »»n  "^  ~n 

uniform,  and  hence,  if  "^CE  )  is  any  test  function  choosen  in  such  a  way  that 


(4.12)  E     *(£*)  =  n!  e  ;    0  <  e  <  1 

E*eS(E  )   ~" 
^n   "-n 


for  all  E  e  6  ,  then 
■»n   ~n 

(4.13)  E  ,„.{4>(E  )}  =  e,   the  level  of  signification. 

^(2)   ~n 

o 
(E  ,    \{'}  denotes  the  expectation  of  {• }  under  H  ^  ). 

H(q) 

o 

Now  to  formulate  't)(E  )  in  a  suitable  manner  and  to  make  it  invariant  under 

'»n 

monotone  transformation  of  the  variables,  we  proceed  as  follows.   For  each  integer 
n,  let 


(4.14)  L^^^  =  (L(iJ,...,L(^i);    i=l,...,p. 

•«n       ^j  '-  n,  n 


be  the  p  sets  of  real  constants,  where 


(4.15)  l(;>  =  j(^>  (^);   a=l,...,n;    i=l,...,p. 

and  where  the  functions  J    have  the  same  interpretation  as  in  section  3.    With 

n 

this  notation  we  can  rewrite  T   .  .  (cf.  (3.9))  as 

n,  Lj 

(4.16)  T   ..  =  -   L  ^^\        L^J^    ;       i,  j=l,  ...,p 

n,  i  i    n     ,   n,  R.    n,  R.   '       ,^}jv 
'  a=l    '  la     ja 
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(2) 
where  R.   is  the  rank  of  X.   among  (X. ,,...,  X.  )  for  i=l,  ...,n.   Under  H   , 
la  la        il'   '  in        '        '  o     ' 

X   and  X.   are  stochastically  independent  for  all  i=l, ...,i  and  i=i+l, ...,p. 
la      ja 

Let  us  now  partition  T  =  ((T   .  .))  i,  1=1, ...,p  as  we  did  (ffl  in  (4.3)  for 
<p2.   We  then  write 


(4.17) 


A"5f» 

((T<\f)) 

T     = 

~n 

V<<\f)) 

note  that 


((T<;^5>))  =  ((T^^^.))  i,J=l,...,.;  ((T<;^f ))  =  ((T^^,.))  i=l    ,i 

J— iJ+i,  .  .  . ,  p 


(4.18) 


((T^^^.P))  =  ((T  ,.  J)  i=i+l,  ...,p  ;  ((Tp^.f ))  =  ((T  ^.  J)  i=i+l,...,p 


Let  us  now  denote  by  ^  ,  the  perrautational  probability  law  generated  by  the 
n!  equally  likely  (conditional)  realizations  of  E  over  S(E  ).  It  is  then 
easily  seen  that 

(4.19)  E(T   ..|^)  =  0   for  all  i=l,  ...,^  ;   and  j=i+l,  .  .  .,  p. 

n,  1 J    n 

(4.20)  Cov(T   .  .,  T      I  {p  )  =  -i-  T   ..   T   ... 

n,  ij'   n,  i»j»'   n    n-1   n,  11'   n,  jj» 

for  all  i,  i'=l,  ...,^   and   j,  j»=i+l,  .  .  .,  p. 

(2) 
Since  we  are  interested  in  a  comprehensive  test  for  H    (i.e.,  when  q=2), 

following  some  simple  steps  we  may  consider  the  test  statistic 

/T\  11  p      p  ..,.., 

(4.21)  V^,;.  =  (n-1)  Z    E     Z      2    T   .  .  T   .  .   T^^   T^^ 

"^^^         i=l  i.=  l  j=i5+l  j.=^+l  "'^J   "'"'J*   "    " 
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where   T^Ji)is  the  (i, i')th  term  in  ((T^  '   ))'   and  T^/2)^^  ^^^   (J,  j')th  term 

(2  2)   -1 
in  ((T  '.  .  ))   .   However,  we  shall  find  it  more  convenient  to  work  with  the 
n,  1 J 

test-statistic  S  ,„,  defined  as 
n(2) 


IIt   li 

(4  22)  S^-^^   =  "^J  

'n(2)     (1,1)     (2,2), 

"  n,  Lj  "  "  n,  ij  " 

where  {{ •  {{  is  the  determinant  of  the  matrix  ((•))•   S   is  analogous  to  the  usual 

n 

likelihood  ratio  test  (cf.  Anderson  (1957),  pp.  233)  where  instead  of  T    »s, 

n,  ij 

the  sample  product  moment  correlations  are  used.   We  shall  first  consider  the 

relation  between  S  ,„,,  and  vV.„.. 
n(2)      n(2) 

THEORY  4.  1.  If  X  has  a  nonsingular  distribution  in  the  sense  that  ^ ,  defined 

by  (3.4)  and  (3.5),  is  positive  definite,  then  IJT  ||  converges  in  probability  to 

II  @  II  as  n-^«),  provided  E|  T   .  .  |  ^   °^/   <  oo  for  some  &  >  0. 
^<  n,  11 

The  proof  of  this  theorem  follows  precisely  as  that  of  Theorem  4.2  of  Puri 
and  Sen  (1966),  and  is  therefore  omitted. 

(2) 
THEOREM  4.2.   Under  H^  ^    defined  in  (2.2), 

(4.23)  |T    I  =  0  (n"0  for  all  i=l,  ...,£;  j=£+l,  ...,p  . 
The  same  result  also  holds  under  the  permutational  law  tP   . 

— 1 V  ^ 

(2) 

Proof.   First  note  that  under  H^   ,  E(T   .  .)  =  0,  i=l,  .  .  .  ,i  ;  j=£  +  l,  ....  p. 
o   '   ^  n,  11      '     '    )     '    J  >         >  f 

1   " 
Furthermore,  since  -  E  J^...(a/n)  is  finite  for  all  i=l, ...,p;  and  since  by 

(3-1),  ~  E  "^j^Q )(°-/")  ~^1  for  all  i=l,  ...,p,  we  find  after  a  few  simple  steps  that 
a=l 

(4.24)  Var„  (T   .  .)  =  —■   ,  c  <  x  for  all  i=l,  ..  .,p;  j=£+l,  -  •  .,p. 

Hn,  ij     n-1'  '    f  fj    J  }         jr 
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Hence  using  (4.19),  (4.20)  and  (4.24)  and  Tchebyschef f »s  inequality  we  obtain 
the  desired  result. 

THEOREM  4.3.   If  I T   ..1=0  (n"2)  for  all  i=l.  ...,£;    j=^+l,  ...,p  then 
—  '  n,  1 J '     p      

(4.25)  I  n(2)  "*"  "  ^°^     n(2)'  "  °  (n"^),  provided  @  is  positive 

definite. 


Proof.   By  Laplace's  expansion  of  the  determinants  (of.  Ferrar  (1941)) 


n       2   ...  A       /I   2 
'l"-^2^' •  •  ^^f =''    V^l  ^2  ••■%/      1^1  "^2  ■ 


(4.26)        llTjl  =        E        ||T/  \l|.l|T* 


^\      2  ...  Il,\ 

where  ||T^  \  ||   is  the  determinant  consisting  of  the  rows  1,2, ... ,1, 

^  /I       2      ...   Z\ 

and  columns  q  ,q„,  ...,q  ,  and  l|T*f  \  ||  is  the  complimentary  deter- 

U    q2  •••  qJ 

minant.   By  virtue  of  Theorem  4.2,  we  have  the  following  results. 
If  only  one  of  q  ,q  , ...,q   is  different  from  1,2,...,^,  then 

(1        2      ...   ^\  1  (\       1      ...   l\  1 

\    II   =   0   (n-2);       ||T*/  \    11   =   0    (n"?), 

^1  ^2  •••  ^y  K  ^2  •••  'i^; 

and, 

if  r  of  q  ,  q„,  ...,q  are  different  from  1,2, ...,£,  then 


(4.28) 


for  r  >  1 
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Consequently,    from   (4.26),    we   obtain 

l|T      ..II   =    llT^^'.^^llllT^^^.^^II 
"    n,  ij"         "   n,  ij    ""    n,  ij    " 

£         &  P  P  (-..  ,    . 

(4.29)  +EZ  E  ET..T      .,.,||T^    •  •  .llllT^    '!  .  J 


H-  Op(n-2) 


where 


(4.30)  HT^^^.II   =   cofactor   of   T      ..      inT^^^.^^         (cf.    (4.18)) 

^  "   n,  11 »"  n,  11  n,  ij 


and 


llT^^^.  ..11  =  cofactor  of  T   ..   in  T^^'.^^    (cf.  (4.18)). 
"  n,  jj'"  n,  jj      n,  ij 

Now  by  definition 

n(l)    "  n,ii«"/  "  n,ij  » 

(4.31)    and 

ii.      f2)       ^2,2) 

T^J   =  II t''''   II  /llT     II 

n(2)    "  n,jj«"/"  n,ij  " 

Hence,  from  (4.22),  (4.29),  (4.31)  and  Theorem  4.1  we  obtain 

(..32)     -"-SS<«^  =  „|^j^^  J^^.J^^^T„^,^T„„^.T-   X^J;, 

+  Op(n-  ) 

which  proves  the  required  result. 

THEOREM  4.4.   Under  the  assumptions  of  Theorem  4.4,  the  permutation  distribution 
of  V  ,^v(or  -n  log  S  .    .)    converges  asymptotically  to  a  chisquare  distribution 
with  ^ ( p -^ )  degrees  of  freedom. 
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Proof.   By  virtue  of  (4.21)  and  Theorem  4.3  it  suffices  to  show  that  any  linear 
combination 


(4.33) 


1  ^    P 
U  =  n2  E    E    d. .  T   . . 

i=l  rl+l     ^J  "'^J 


has  asymptotically  (under  JP  )  a  normal  distribution  in  the  limit  with  mean 
zero  and  finite  variance.   That  the  mean  is  zero  and  variance  is  finite  follows 
by  using  (4.19)  and  (4.20).   To  prove  the  asymptotic  normality  of  U  ,  let  us 
denote 


(4.34) 


so  that 


&  P                   /  .K           /  .X 

Z  (R  )  =  E  E  d..  L^^;   L*--^^   ,   aFl,...,n 

a  "n    •  1  •  ,  ,1   iJ   n,  R.    n,  R.   '     '    ' 

1=1  }=Z+l                '  la     JO, 


(4.35) 


-i  " 
U  =  n  "  E   Z  (R  ) 

n         ,   a  "n 
ofI 


Now  for  any  E  e  S(E  ),  defined  in  (4. 11),  Z  (R  )  can  assume  nJ  equally  likely 
■'  -n     '•n  '  '    a  '•n  ~i  j  j 

values  obtained  by  letting  g  e  S  .  Essentially  then  by  the  same  method  of 
proof  as  in  Wald-Wolfowitz-Noether  theorem  (cf.  Fraser  (1957),  p.  23  7)  it  is 
easily  seen  that 

2kJ 


(4.36)         E^J";VfE^n(^S)r 


+  o(l) 


kJ2 


o(l) 


if  i=2k 


if  r=2k+l 


for  k=0,  1,2, 


Hence  the  theorem. 


Thus  for  large  samples,  the  permutation  test  based  on  V  ,  ,(or  -n  log  S  /^\) 
reduces  to  the  following  rule: 

(4.37)    If   V^J))  or  -n  log  S^;J>)   >>^",(p.O,a'  -J^"^  «o'^ 


<  ^  £(p-^),a^  ^''^^P'  "o 


(1) 
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where  X^    is  the  100(l-a)7o  point  of  a  chi-square  distribution  with  r  degrees 
r,a 

of  freedom. 

We  will  not  discuss  briefly  the  case  of  q(q  >  2)  subsets.   Here  we  parti- 
tion X  as  in  (2.1),  (HJ  as  in  (4.3),  and  T  accordingly.   Thus 

UT^''.'^)),  ...  ,  ((T^''.^^)) 


(4.38)  T  = 

'"n 


n,  ij    '  n,  ij 


((T^^';^), ... ,  ((T^^^.'j^) 

Uj  1 J  n,  1 J 


(k  £) 
where  ((T  '.  .    ))  is  of  the  order  i  x  i  •  k,^=l,  ...,q.   Then,  by  analogy  with 

parametric  likelihood  ratio  test  statistic,  for  the  problem  of  testing 
(4.39)  H^'^):  ®j^^^=  ^U=^   k^^=l,...,p. 

We  consider  the  test  statistic 


(J)         K  iii 

(4.40)  S^''^  =  SriJ. 


n(q) 


k=l   "  n,  ij  ' 


Proceeding  as  in  Theorem  (4.3),  we  can  show  that  -n  log  S  ,  .  is 

n(q)    p 

asymptotically  equivalent  to  a  positive  definite  quadratic  form  in  L      i,  i. 

k<g=l 
statistics  T     where  X^   and  X    (k<:g=l,  .  .  . ,  p;  a=l,  ...,n)  belong  to  two  dif- 

ferent  subsets  of  variates.  The  permutation  argument  considered  earlier  readily 
extends  to  the  present  case,  where  we  shall  have  (nj)  equally  likely  realiza- 
tions.  Further  as  a  straight  forward  generalization  of  theorem  4.4,  we  shall 

arive  at  the  conclusion  that  under  the  permutation  model,  -n  log  S  ,  .  where 

n(q) 

S  .  .  is  defined  in  (4.39)  has  asymptotically  a  chi-square  distribution  with 
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E  i   i   degrees  of  freedom.   Thus  we  have  a  test  criterion  very  similar  to 

(4.37)  with  the  only  difference  that  the  degrees  of  freedom  will  be  equal  to 
P 

k<e=i  ^   ^ 

As  a  special  case,  we  consider  q=p.  Here  i  =  . . .  =  i  =1,  and  the  prob- 
lem reduces  to  that  of  testing  the  hypothesis  of  total  independence  of  all  the 
p  variates,  (cf.  (2.3)).   Here  we  may  work  with  the  test  criterion 

(4.41)  V^;J\  =  (n-1)   E    T^  .  ./T   ..  T   .. 

n(p)         i<i=l  "' ^J   "'^^   "'J  J 

(which  is  asjmiptotically  equivalent  to  -log  S  ,  v  where  S  ,  ,  is  defined  by 

(4.40)  by  putting  q=p).   In  this  case  the  permutation  distribution  of  V  .  ^ 

coincides  with  the  unconditional  null  distribution,  and  following  the  lines 

of  theorem  4.4,  it  can  be  shown  that  under  H    defined  in  (2.3),  V  ,  .  has 
'  o  '      n(p) 

asymptotically  a  chi-square  distribution  with  p(p-l)/2  degrees  of  freedom. 

5.   Asymptotic  Normality  of  T  for  arbitrary  F(x) .   In  this  section  we  shall 

establish  the  joint  asymptotic  normality  of  the  statistics  {T   . .;  l<i<j<p) 

defined  by  (3.9)  for  arbitrary  F(x). 

We  shall  make  the  following  assumptions: 

Assumption  5.1.       J  ..v(u)  =  J...(u)  exists  for  0  <  u  <  1  and  is  not  con- 
1 n-^oo  n(i)       (i) 

stant  for  i=l,  ...,p. 


Assumption  5.2. 

,    /{  ,     {•^n(i)f^n[i]<V]  '^n(j)t^n[j]<^j)]  "  "^(D^^C  i]  <  V^ 
"■nd)  ""   ■^n(j) 

J(.)[F^.^(x.)]]dF^^^^.^(x^,x.)  =  Op(n-i) 


where 
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(5.1) 


Vi)=  ^\'-      °<^n(i)<V<l^'   i=l,...,p. 


Assumption  5.3. 


J„(j^)(l)  =  oCn"*);   i=l,  ...,p. 


Assumption  5.4.       |  J,  ..(u)|  <  K[u(l-u)]  "'^  ,    0  <  a  <  1/8 


|j^.)(u)|  ^  K[u(l-u)] 


-1 


|j'('.)(u)|  ^  K[u(l-u)] 


-2 


where  JJ . x  and  JV.s  denote  the  first  and  second  derivatives  of  J,  .^  respectively, 
(i)      (i)  (i)     ^  ^' 

K  is  some  constant,  and  i=l,  ...,p. 
Finally,  we  shall  denote 


"•2)     "„(!,:)  =    {    {  •'(i)  [^i)<-i»  •'(J)  [^j)<-j"  '  ''(i,3)<V>'j' 


(5.3)        n  a 


=   n  a 


n(i,  j)  n{(i,  j),  (i,  j)) 


.     .x->  =   Var 


E      U 


(a) 
(i,  j);£ 


3  3 


(5.4)        na    .,.     .X    ,        ni   =     S  E      Cov(U^°'\.    ^,    0^°"^    ,    ^,) 

n((i,  j),  (r,s))        ^=1^1=1  Mi,j);i'       (r,s);^» 


where 


Xa) 


(5.5)        U)    ''    s    ,   =   J,    s    [Fr    t(X     )]    J,    ,    [Fr    t(X     )] 
(r, s);i  (r)         [r]^    ra^'      (s)        [ s] ^    sa'' 


if  t=\ 


+aD    +O0 


=     /     /C^x      (V    -   ^[r](V^    ^(s)    f^[s]<-s>^    ^U)    ^^-rJ^V^ 


-cx)     -00  rCL 


'^^[r,s]^V'^s^'    ^^'=2 
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OO  00 

-00  -00       so, 

if  &=3,    r  <  s=l,  ,..,p 
and,  where 

(5.6)  F,„  (u)  =  1  if  X   <  u  and  zero  otherwise. 

^    '^  IX  sa  = 

sa 

We  shall  now  prove  the  following 

THEOREM  5.1.   Under  the  assumptions  5.1  to  5.4,  the  random  vector  with  elements 

n2[(T  ,.  .V  -  H  / .  .x),  1  <  i  <  j  <  p]  where  ]i    . .     ..  is  given  by  (5.2)  has  a 
n(i,  j)     n(i,  j)     =       =  nU,  J^ 

limiting  normal  distribution  with  mean  vector  zero,  and  covariance  matrix  E  = 

n((a  f,.  .N  /    m))  given  by  (5.3)  and  (5.4)  respectively. 
n{(i,  j),  (r,s)}   ^ ^       

Proof.   We  write 

-  ^(i)tVi]<^^^'(j)^'n[j]<^j)"  -^  ^(i)^Vi]<VJ 


•^(j)^^n[j](V^ 


and,    (by  Taylor's    theorem) 


(^•«>         ^(i)tVi]^V^  ^(j)^Vj]^^j^^  =  •^(i)t^[i](V^  •^(j)^'[j]^^j^^ 

^   [^n[i]<V    -   ^[i]<V^    •^(i)f^[i](V]    '^(j)^'[j]^^j^^ 

^f^n[j](V    -^[j]h>^^0)f^[j]^V^^(i)^^[i]^VJ 

+  i-[F   r-i(x-)    -   Ffl^^-)^^   j';.s[eF   r  .n(x.)    +   (l-e)Fp..(x.)]    J      JGF   .  .t(x   ) 

s"-    nLiJ       1  LlJ       1  (i)         nL  ij       1  LiJ       1  (j)         nLjJ       J 

+   (l-e)F^^j(Xj)] 
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+  (i-e)F^.j(x.)] 


0  <  e  <  1 


and 


(5.9)  dF  r  .  .-,(x.,x.)  =  d[F  r  .  .-,  (x.,x .)  -  F.  .  .-,  (x.,  x.)]  +  dF.  .  .,(x.,x.), 

Then,  proceeding  as  in  Bhuchongkul  (1964),  we  can  express  T   ..  (cf.  (3.9))  as 

n,  ij 

(5.10)  T   .  .  =  E  A^"-'^^  +  E  B^"^' J^ 


n, 11     ,  rn        i  rn 
'  -^    r=l         r=l 


where 


(5.11)  aJ^'J>  =   /  ;  J(,)[F[,j(x.)]  J(j,[F[j](Xj)l  dF^j     (x^,x  ) 

—00   —00 
f'\  00    00 

(5.12)  A^l'^^   =   /  /  [F^^,j(x.)  -  F[i](-)]J(j)[F[jj(x.)]j',)[F^,j(x,)]  dF^i,j](x^j) 

—00   —00 
/    •  \  00     00 

(5.13)  4^'^^=      f     /[F^^.^Cx.)  -F   (x.)]J(^^[F^^j(xp]j;.^[F^^3(x.)]  dF^^^.j(x^^ 

—00   —00 

and,  the  B  terms  are  all  o  (n  2) ;  (cf.  Bhuchongkul  cited  above). 

i         "^   (i  i) 

The  difference  n2[T   . .  -  E  A  '      ]    tends  to  zero  in  probability  as  n 

n,  ij     ,  rn 
'      r=l 

tends  to  infinity,  and  so,  by  a  well  known  theorem  [Cramer  (1954  p.  299)J,  the 

I  i  "   (i  i) 

vectors  n2[T   ...  1  <  i  <  j  <  p]  and  n2[  E  A  ' -^   ,    1<  i  <  i  <  p]  have  the  same 

'    •'  r=l 
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limiting  distributions,  if  they  have  one  at  all.   Thus,  to  prove  the  theorem,  it 
suffices  to  show  that  for  any  real  A-  •  (1  <  i  <  J  <  P)^  ^°^   ^H  zero, 

E   Z  A  .(a\}' '^     +  Kt' '^     +  KI/ '^   )   has  normal  distribution  in  the  limit.   Now 
.  ,  .  ,  11   IN       2N       3N 
1=1  j=l 

3   ..  .s 

E  A  '    can  be  expressed  as 
r=l  ''^ 

j^j^  rn      n  ^^^   (i,j);l    (i,j);2    (i-,j);3 
where  the  U, .  .,'s  are  given  by  (5.5).   Hence 

\^}  i) 

(5.15)       E        E   A.j{E   A^;'j>}  =  i    E    [E        '  A,  ,  {uj^^  ,,  . ,    +  U^^^,  .,   +  u;-\,  .,  ]] 


"  C3c=i  i=i  j=i  ^^     (^^J);i        (i^j);2        (i,j);3 


i=l    j=l      -^    r=l 

i<j  i<j 

The  right  hand  side  of  (5. 15)  is  the  average  of  n  independent  and  identically 

P   P 
distributed  random  variables,  each  having  mean  E   E  A. .  H  /.  -n  and  finite 

i=l  j=l  ^J   "(^^J^ 
i<j 

third  moments.   Tha  asymptotic  normality  follows.   Furthermore  using  (5. 14), 

it  is  easy  to  check  that  the  variance-covariance  matrix  E  =  n((a  r,.     .%  /•    m)) 
^  ~     ^^  n{(L,  j),  (r,s)j 

is  given  by  (5.3)  and  (5.4).   The  theorem  follows. 

The  following  theorem  gives  a  simple  sufficient  condition  under  which  the 
assumptions  5.1,  5.2  and  5.3  hold. 


THEOREM 


5.2.   If  J  ,  .(a/n)  (i=l,,..,p)  is  the  expected  value  of  the  ath  order 
—  n(i)   '       ^    7  IT  L 


statistic  of  a  sample  of  size  from  a  population  whose  cumulative  distribution 
is  the  inverse  function  of  J , .\,     i=l,  . . .,  p  and  if  the  assumption  5. 4  of  theorem 
5.1  is  satisfied,  then  the  assumptions  5.1,  5.2  and  5.3  are  also  satisfied. 
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The  proof  of  this  theorem  is  analogous  to  that  of  theorem  2  of  Bhuchongkul 
(1964)  and  is  therefore  omitted. 

With  the  use  of  this  theorem  it  is  easy  to  verify  that  if  J   v(a/n)  is  the 

n(i)   ' 

expected  value  of  the  ath  order  statistic  of  a  sample  of  size  n  from  (i)  the 
standard  normal  distribution,  (ii)  the  logistic  distribution,  (iii)  the  double 
exponential  distributions,  (iv)  the  exponential  distribution,  and  (v)  the  uniform 
distribution,  then  the  vector  n2[  (T  .  .    -  ]i    , .     ..),  1  <  i  <  j  <  p]  has  a  limit- 
ing  normal  distribution. 

We  shall  use  the  results  of  theorem  5.1  in  deriving  large  sample  power 
properties  of  the  statistics  associated  with  the  (i)  tests  of  independence  of 
q  >  2  sets  of  variates,  and  (ii)  tests  for  pair-wise  independence.   For  the 
simplicity  of  presentation,  we  shall  first  consider  the  problem  of  testing  in- 
dependence of  two  sets  of  variates. 

6.   Testing  Independence  of  Two  Sets  of  Variates.   Let  the  p  component  vector 
X  be  partitioned  into  two  subvectors  X    and  X    as  defined  in  (4.6). 

Let  the  population  dispersion  matrix  (g)  defined  in  (3.4)  and  (3.5)  be 
partitioned  as  we  did  the  sample  dispersion  matrix  T  defined  in  (3.10).   Then 
we  have 


No 


^   /®11  ® 

.1)  @  =(  '"  ~ 

V©21  ®22, 
te  that  (h)    is  of  order  £    xZ,     (h)    is  of  order  ^  x  m.   Then,  for  the  prob- 


(2) 

lem  of  testing  H^  ' 


(6.2,  "r^@i2=@21-S 
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we  propose  to  consider  the  statistic  S  ,„.    defined  by  (4.22).   The  test  consists 

in  reiectine  H    if  S  ,„.    exceeds  some  predetermined  number  h  .   We  shall  prove 
■"     °  o       n(2)  e 

below  that  if  H    is  true,  the  statistic  -n  log  S  ,  .  has  a  limiting  chi-square 

distribution  with  Zm   degrees  of  freedom  as  n-»oo.   This  provides  the  user  of 

this  S  ,;,v  test  with  a  large  sample  approximation  of  the  value  of  h   for  any 
n(2)  e 

0  <  e  <  1. 

We  shall  now  study  the  asymptotic  distribution  of  S  ,  v  assuming  a  sequence 

(2) 
of  alternative  hypotheses  {H   ,  ifI, 2, ...}  which  specifies  that 

(6.3)    ^ij^^[i]^^[j]^  "  ^  "^  ""'^  ^ij^^[i]'^[j]^'   i=l.  •••.-«,  J=-2  +  l.---,P 


where  D,.  .    is  defined  by  (3.13),  uj.  .  is  some  function  of  (Fr  . -,,  Fr  .^ ),  and  u).  .  3^  0 

for  all  (i,  j);  i=l,  .  .  .  ,£  ;    j=^+l,  ...,p.   It  may  be  pointed  out  that  the  sequence 

(2) 
of  the  alternative  hypotheses  {H   ,  n=l,2, ...}  defined  in  (6.3)  implies  that 

(6.4)  ®12  =  ^1  =  ""*  ®12 

where  (B,«  =  ((P-  •))  is  an  £  x  m  matrix  where  elements  are 


^'•'^    ^iJ  ^  i  1   '[i]^'^  'tj](^>  '^ij<^[i]^^[j]>'^(i)t^[i](^)]^(j)t^[j](^>] 

dF^.^(x)dF^^^(y) 

i=l,...,Z;    j=£+l,  ...,p. 

The  following  lemma  is  an  immediate  consequence  of  the  fact  that  var{T   .  .} 

n,  Lj 

tends  to  zero  as  n-^oo  for  all  F  e  '^  . 

Lemma  6.1.   Under  the  assumptions  of  theorem  5. 1, 

(a)   T   -$►  @  in  probability  as  n-^oo  for  all  F  e  ^. 
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!!•  ii» 

(b)      T    .V    -^       ^(\\      ^"   probability   as   n-»oo  for   all   F  e  ^ 


P 
i,  i'=l,  ...,i 


J.I1    tJLuuaui.1  J.  Ly    as    n-^oo    tor    ail    f    e  vf 

P 

j,  jt=i  +  l,  ...,p. 


(c)      T^^2)    "^  9(2)    ^"   probability   as   n-^  for   all  F  e  3^ 


where 


(6.6)    ej:^  and  9^^.  are  the  (i,  i»)th  and  (j,  j»)th  elements  of®"!  and  (hT^ 


(1)       l2)  \.J-,  J--^  Lu  duu  \.j,  j-yLii  eiemenLS  oi  ^m  ,  anamv^ 

respectively. 
Lemma  6.2.   If,  for  each  i  <  j  =  1, . . , , p 

(i)   the  conditions  of  theorem  5. 1  are  satisfied 

(2) 
(ii)   the  hypothesis  {H   }  defined  by  (6.3)  is  valid  then,  the  matrix  with  ele- 

i_ 
ments  n2[(T   ..  -  n    ,.     .^),    i=l,  ...,i;  j=^+l,  ...,p]  has  a  limiting  normal  dis- 

tribution  with  means  zero  and  variance-covariance  matrix  Z*  =  ((a^/.  .\  /    \i)) 
l(i,  j),  (r,s)J 

where 

(6.7)       a\l.^   =  a|(,^  .)^(,^j)3  =  1     if  i=l,...,2;    j=^+l,...,P 


(6.8) 


"t(i,j),(r,s)}  =  ^ir  ^js   i^-l>--->^'  j,s=^+l,...,p. 


(note  that  9.  .  =  1). 
11 
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The  proof  of  this  lemma  is  an  immediate  consequence  of  theorem  5.1,  (6-3)  and 

(3.1),  and  is  therefore  omitted. 

THEOREM  6.1.   Under  the  assumptions  of  lemma  6.2,  the  limiting  distribution  of 

-n  loe  S  ,;!!.  is  non  central  chi-square  with  Z   m  degrees  of  freedom  and  noncentrality 
n(2; —  ^ 


parameter 

0.  Q.  n  n 


(6.10)   A„(J)  =  E   E 


0'        ^  P      P  ,-,•.   4^, 


s(J)  =  E   E    E    E   p   p    e  .  ej^. 

n(2)    i=l  i«=l  j=i!+l  jt=i!+l   "-J   ^  J    ^^^  ^^^ 


Proof.   Under  the  assumed  conditions  n^  T   . .  is  bounded  in  probability  for  all 
n,  ij 

i}=\f...,li;    j=^+l,  ...,p:   Hence  applying  Laplace  expansion  for  the  determinant 
||T  II  and  proceeding  exactly  as  in  theorem  4.3,  we  obtain 

(J)       ^   i     p      p  ...    ..,        _j 

(6.11)  -n  log  S\:^.  =  n  E   E     E      E   T   .  .  T   .  ,  .   T     T"^;!   +  0  (n  ). 

"(2)     i=i  i.=  i  j=^+l  j.=^+l  ".^J   %^'J'   n(l)   n(2)    p 

Using   lemma  6.1,    we  notice   that   -n   log  S    ,\    is   as3rmptotically   equivalent    to 

n(2) 

V*  where 
n 

Z        H  p      p  ..,.., 

(6.12)  V*  =  n  E   E     E      E   T   ..  T   q)\.    q]1. 

i=l  i.=  l  j=i+l  j.=£+l  "'^J   "'^'J'   (^^   (2) 

where  9/,\  and  qA-^    are  defined  in  (6.6).   The  result  now  follows  by  an  application 

of  lemma  6.2. 

We  shall  consider  the  special  forms  of  the  function  A  (J)   for  suitable 

n(2) 
choices  of  the  function  J  in  section  8  where  we  consider  the  asymptotic  properties 

of  the  S  ...    tests  in  relation  to  their  parametric  competitors  based  on  the 
n(2) 

likelihood  ratio  test  [cf.  Anderson  (1947),  p.  233]. 

In  the  passing  we  may  remark  that  forthe  problem  of  testing  independence 

of  q  sets  of  variates  against  the  sequence  of  alternatives  {H  ^  ,  n=l, 2, ...} 

which  specify  that  n..   is  of  the  form  (6.3)  for  all  i,  j  belonging  to  different 

sets,  it  can  be  shown  by  the  use  of  theorem  5.1  that  -n  log  S  ,  ,  (cf.  (4.40)) 

n(q) 
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has  asymptotically  a  non-central  chi-square  distribution  with   E    i,  i  degrees 

k<e=i  ^   ^ 

of  freedom.   The  details  are  omitted  because  of  the  intended  brevity  of  the 
presentation. 

In  a  relatively  simple  case,  when  cpp,  the  statistic  (4.41)  has  under  the 
sequence  of  alternatives  (6.3),  asymptotically,  a  non-central  chi-square  dis- 
tribution with  p(p-l)/2  degrees  of  freedom  and  non-centrality  parameter  A  (J) 

n(p) 
given  by 

where  p. .  is  defined  in  (6,5), 

7.   Pair  wise  Independence.   In  view  of  the  fact  that  the  pair  wise  independence 
does  not  in  general  imply  total  independence,  we  shall  consider  this  problem 
seperately  in  this  section.   To  be  precise  we  consider  the  problem  of  testing 

(7.1)  H*:   F|-^  j^(x.,Xj)  =  F^.j(x^)  •  ^[jjCXj)  for  all  pairs  (i,j) 
against  the  sequence  of  alternatives  {H*,  n=l, 2...}  which  specifies  that 

(7.2)  F.  .(x.,x.)  =  F.(x.)  .  F.(x.)  {1  +  n"2  u).  .(F.,F.)} 

iJ   1   J     11      J   J  ij   1   J 

where  oj..(F.,F.)  is  not  identically  equal  to  zero  (a.e.),  for  at  least  one  pair 
(i, j);  1  <  i  <  j  <  p.  The  test-statistic  proposed  for  this  problem  is  based  on 
the  statistic 


(7.3)  £    ,^.   =   n   T*  r'^   T*' 

n(J)     ^n       ~n 


where 


(7.4)  T*  =  (T   .„,  T   ,-,...,T   ,   ,   .) 

-n     n,  12'   n,  13'    '    n,  (p-1,  p) 
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and   r    is  a  consistent  estimator  of  T  which  is  the  inverse  of  the  co- 


variance  matrix  of  n-?  T*. 

n 

THEOI^EM  7.  1.   If,  for  each  i  <  j  =  1,  ...,p 

(i)   the  conditions  of  theorem  5. 1  are  satisfied 

( i i )   the  hypothesis  H*  defined  by  (7.2)  is  valid 

then,  the  statistic  oS  /  -n  defined  by  (7.3)  has  asymptotically  a  non-central  chi 

square  distribution  with  p(p-l)/2  degrees  of  freedom,  and  non  centrality  parameter 

A      defined  as 
^n(J) 


(7.5) 


A  -   =  A  r"^  A 

<(J)    '^ 


where 


(7.6) 


A=  0,2.  Pl3---Pp-l,p) 


(The  |3..»s  are  defined  in  (6.5)). 

Proof.  The  proof  of  this  theorem  is  an  immediate  consequence  of  the  fact  that 

under  the  given  assumptions,  the  random  vector  n/ n  T*  defined  by  (7.4)  has  a 

limiting  normal  distribution  with  mean  vector  A  given  by  (7.6)  and  the  co- 

variance  matrix  ^  =    i(.i  r , .     .^    ,        m))   where 

{(i,  j),(r,  s)} 


(7.7)   T 


{(i,j),(r,s)} 


< 


1,   if  i=r,  j=s 

^o[-^(i)(^i(V)'^(j)(^j<-j»'^(r)(^r)(V)-^(s)(^s(V] 

if  i/r,  j/s 

VJ(i)(^(^))^(j)(^j(-j)>^(s)(^s(^s»^  ''  '=''  j^^ 
V"^(i)(^(^^^"^(j)(^j(^j>>-^(r)^^r^V^]  ^f  ^^^'  j=^ 
V'^(i)(^i(^))'^(j)(^j(^j)>-^(r)(^r^V^]  i^  ^=='  j/^- 


■^-1  -1 

and,  the  fact  that  V        is  a  consistent  estimator  of  F 
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From  theorem  7.1,  it  is  clear  that  any  consistent  estimator  of  I~  will 

preserve  the  asymptotic  distribution  of  the  statistiCoZf  ,  ..   A  natural  con- 

n(J) 

sistent  estimator  of  (7.7)  may  be  obtained  readily  by  replacing  J    (F.(x  )) 
^y   '^n(i)^^n(i)^^ia^^  ^°^   ^^^    ^'    ^^^    ^^^   expectation  by  sample  averages. 

8.   Asymptotic  Relative  Efficiency.   In  this  section  we  shall  make  large  sample 

power  comparisons  between  the  S  .    .    tests,  ^^  '^  tests,  and  their  parametric 

competitors  based  on  the  likelihood  ratio  test.   First  we  consider  the  interesting 

particular  cases  of  the  S  ,„,,  and  ^v,      tests. 

n(2)       n 

(a)   Special  cases  of  the  S  ,„,  tests. 
-^ n(2)  

a(i)   Let  J,  .(u)  =  $   (u)  where  4  is  the  standard  normal  distribution  function. 

The  -n  log  S  .    .    test  then  reduces  to  the  normal  scores  -n  log  S  ';:.  test.   The 
^\^)  n(2) 

non-centrality  parameter  in  this  case  is  A  ($)   where 

n(2) 

Z  £  p  p  .... 

(8.1)      A  ($)  =  E   s   z    s   P..  P.,.,  q)]1  ej^' 

^n(2)    i=l  i.=  l  j=i+l  j.=i!+l   ^J   ^'J'   (1)   (2) 
where 


00    00 

(8.2)   p   =   /  /  Fr..(x)Fp  .-,(y)u,..(F.  ,,F.  .O  ^-i •  r—^ 

"'        -  -  ^'^     ^'^  ''      ^'^      ^'^      *[<I-\f^.^(x)    0[r^F^  .^(x 

dF^.^(x)  dFfjj(y)  (cf.  (6.5)) 


and 

(8.3)    e^^  ,  e-"-"    are  the  (i,  i»)th  and  (j,  j')th  terms  of 

®\.l  ~    ^^^ii^\^2,      ^"^  ®22  ~  ^^®'  '^^     respectively,  and  where 


00    00 

-Ir^    r  M  .-Ir 


(8.4)    e   =   /  /C   [F.  .(x)]  <t  ^[F.  .(y)]  dF.    .(x,y), 

-"     -00  -oo        ^      ■'  LJ-'  L-L>JJ 

In  an  important  case,  when  F(x)  is  N(ii,E) 


where 
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(8.5) 


K    (n)v^ 


or  i=^+l,  ...,p;  j=l, 


.P 


and  n. .  =  o. .  otherwise. 


We  have 


(8.6)    ...(F^,pF^.])=  ^ 


t.  .(x,y,p^"^) 
*i(x)  *.(y) 


where  0.  .(x.y, o.  .  )  is  the  normal  cdf  of  (x.,x.)  where  X.  is  N(0,  1),  X.  is  N(0, 1) 

and  o.  .   is  the  correlation  coefficient  between  X.  and  X..   Expanding  (J),  .(x,  y,  p.  .  ) 
ij  1      J  ij   -^  '  ij 

(n)  -i 

about  p. .  =  0,  and  neglecting  the  terms  of  order  o(n  2),  we  obtain  (also  see 

Sibuya  (1959)) 


(8.7) 


*.(x)  <^  (y) 
'ij^^[i]'^[j]''  "  ^ij  $.(x)  $.(y) 


where  ^.    is  the  density  of  $.  ;  i=l,  ...,p.   Hence,  when  F(x)  is  normal  N(p,E) 


(8.8) 


P^j  =  A^j;  i=l,...,i;  j=£+l,...,p. 


(8.9) 


^^j  =  Pij'   i=l?--->^;  j=l,...,^;  or  i=i+l,...,p;  j=^+l,...,p. 


Hence  from  (8.1),  in  case  F(x)  is  normal 


(8.10)  A^($)  =     E 


ii'       Jj* 


M  =     L  L  L  E      A    .   A  p.    .      p;!^. 

n(2)      i=l    i.=  l    j=i+l    j.=£+l    ^J      ^    J        ^^^         ^^^ 


where 
(8.11) 


p.    V    and    p/„N    are    the    (i, i»)th    and    (j, j')th    terms    of 

fn=  ((Pij))"'    ;    ?22=  ((Pij»'' 

1,  j=l,  ...,^  i,  j=^+l,  ..  .,p 
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(J) 


ank 


a(Li)   Let  J...(u)  =-712  (u-',),  then  the  -n  log  S  ,_>  test  reduces  to  the  r 
*       K^)  "  n<.2; 

(R) 

sum  statistic  -n  log  S  .  .  which  is  a  multivariate  analogue  of  the  Spearman's 

rank  correlation  statistic  [cf.  Kendall  (1955)].   The  non-centrality  parameter 

in  this  case  reduces  to  A  (R)  ,  where 

^(2) 

H  Z  P  P  ..,.., 

(8.12)   A(R)  =  E    E    E      E   P   P     eJJ  eJJ 
n(2)    i=l  i'=l  j=^+l  j«=£+l  ^J   ^  J    ^^^   ^^-' 

where 

(8.13)  p..  =  12  _/TF[i](-)^[j](y)  -ij(^[i]'^[j])  '^F[i](-)  dF[j](y) 

and 

®(1V  ®(2)  ^^^  '^^  (i,i')th  and  (j,j')th  terms  of  I^J  =  ((9..)).  .^^     ^  and 
(hT  =  ((0  )).  .  ..,      ,  respectively  and,  where 

^^  J.  J     1,  J—Jii'Ti.f  .  ,  .  f  p 

00  00 

(8.14)      0   =12   /  /  (F  (x)  -  i)(F  (y)  -  i)  dF.    .(x,y) 

-J  -00-00  J  1-ljJJ 

In  case  F(x)  is  N(jx,E)  where  E  =  ((p..  )),  p'.  =  p..  for  i,  j=l,...,£  or  i,  j=f +1,  .  . . ,  p 

and  p^   =- —  A^.   for  i=£, ...,£;    j=f+l,...,p   or   i=i+l,...,p;  }=!,.. .,Z 

V  n 

the  non-centrality  parameter  A  (R)   reduces  to 

n(2) 


^    ^     p      p  ii«   jj' 

^(2)   ^   i=l  i.=  l  j=^+l  j.=«  +  l   ^J   ^'J'   <1)   (2) 


(8.15)   A^(R)   =  -I   S    E    E 


where 


ii»      jj« 
(8.16)      po  .  and  po  ,  are  the  (i, i»)th  and  (j, j')th  terms  of 
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p  *=  (C^  Sin   -y^))  and  p^^     -    (C^Sm   -j-^)) 

i,  j=l,  ...,^  i,  j=^+l,  ...,p. 


respectively. 

(b)   Special  Cases  of<^    tests. 

8(b. 1)   Let  Jx.v(u)  =  $   (u).   Then  the  oC  (J)  reduces  to  the  normal  scores 

oC    (0)  test.   The  non-centrality  parameter  is  then  (cf.  (7.5)) 
n 

(8.17)  A.  (0)  =  A  r"-^  a' 

n 


where 


A=  (^\r\y->\.i^ph    and 


^'^^  ^^"{(i,j),(r,s)}^>  ^^  given  by  (7.7). 

In  a  special  case  when  F(x)  is  N(ii,E)  where  E  =  ((p.  .  )),  pairwise  indepen- 
dence  is  equivalent  to  total  independence  and,  hence  from  (8.7), 

n 

(8b.  2)   Let  J,  .(u)  =  sfl2(,u-^),    then  the  ^  ( J)  test  becomes  the  rank  sum^^  (R) 
(i)  2  •'  n  n 

test.   The  non-centrality  parameter,  for  the  case  when  F(x)  is  N(}i,E),  is 

(8-19)  V^)=^^^' 

(c)   Parametric  Theory.   In  the  parametric  theory  a  commonly  used  test  for  the 

(2) 
hypothesis  H    is  based  on  the  statistic  [see  Anderson  (1957),  p.  242)] 


(8.20)  V^v=  |A|/|A^J-|A22 


where 
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n  i\l        ^12 

(8.21)  A  =  E  (x^-x).(x^-£)  = 

(8.22)  A..=  E  (x^^^-x<^>).(x^j^-x^j^);    i=l,2 

(8.23)  x^^^  =  L      x^^Vn 

a=l 

It  is  easy  to  see,  by  an  application  of  a  theorem  due  to  Wald  (1943)  that 

(2) 
under  the  sequence  of  alternatives  of  the  type  H   ,  -n  log  V*  has  asymptotically 

a  non-central  chi-square  distribution  with  £m  degrees  of  freedom  and  non-centrality 

parameter  Z\^^  given  by 
n 

&        £  P      P  ..,.., 

(8.24)  ^    =     Z       L  Z  E  A   A     p\\^   p^.^ 

n   i=l  i'=l  j=^+l  j«=f+l  '•-'   "■  J    ^"^^      ^^' 

where  P/-|\  and  P/«>.  are  defined  in  (8.11). 

Furthermore,  as  mentioned  earlier,  since  under  the  normal  theory  model, 

pair  wise  independence  is  equivalent  to  total  independence,  therefore  the  tests 

for  H    and  H*  are  based  on  the  same  statistic,  viz. 
o        o  ' 


(8.25)  U*  =   '^ 


1-1   " 


wh 


ere  A  =  ((a.  .)).  .  ,       is  defined  in  (8.21),  and 
^^  LJ   1,  j=l,  ...,p  " 

n      _       _  n 

(8.26)        a.  .  =  E  (x.  -x.)(x.  -x.);   x.  =  Ex.  /n. 

It  turns  out,  (again  by  an  application  of  Wald»s  theorem  (1943))  that  -n  log  U* 
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has  under  the  sequence  of  alternatives  (7.2)  (which  in  the  case  of  normal  theory 

model  imply  p ; .  =  — A.,  for  all  i, j=l, ...,p)  asymptotically  a  non-central  chi- 

■'"-'    N/n 
square  distribution  with  p(p-l)/2  degrees  of  freedom  and  non-centrality  parameter 

(8.27)  V  =  ^  ^' 

where  A  is  defined  in  (8.17). 

Now  it  is  well  known  [cf.  Puri  and  Sen  (1966)]  that  if  the  two  statistics 
have,  under  the  alternative  hypothesis,  non-central  chi-square  distributions  with 
the  same  number  of  degrees  of  freedom,  then  the  asymptotic  relative  efficiency 
of  one  test  with  respect  to  the  other  test  is  equal  to  the  ratio  of  their  non- 
centrality  parameters  after  the  alternatives  have  been  set  equal. 

Hence,  denoting  er  rr^if    as  the  asymptotic  efficiency  of  a  test  T  relative 
to  T*,  we  have 


n(2)'         n(2) 

where  A  (O)   and  A^^   are  defined  in  (8.1)  and  (8.29),  respectively. 
n(2)      ^ 

(8.2,)  ej3(K)^^^,  -  VR)^/V 

where  A  (R)  ,  and  A*  are  defined  in  (8.12)  and  (8.29),  respectively. 

"^'^  (u) 

In  particular,  where  F(x)  is  N(}i,  ((p..  ))),  it  is  easy  to  notice  from  (8.10), 

(8.15)  and  (8.29)  that 

(8.30)  e^  (0)      =  1 

^  n(2)'^^ 
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^  £     P      P  ,i,   jj. 

T-  Z     E      E   A .  .  A .  ,  . ,  p*   p* 

/o  onx         ^  /R>,      -  _9_  i=l  i'=l  .1=i+l  i'=^+l   ^^   ^  J 

(8.31)         ^[S^^J  'V*]  -7r2  1  i     p  p — 

"^'^  E  E     E      E   A..  A.,.,  p^^'  pJJ' 

i=l  i'=l  j=^+l  j'=£+l  ^J   ^  J 


lit    jj»    iii       lit 
where  P^iy  P/o)'  Pci)  ^""^  '^O')  ^^^  defined  in  (8.11)  and  (8.16)  respectively. 

From  (8.30)  and  (8.31)  we  note  that  whereas  in  the  case  of  normal  distribu- 
tions, the  property  of  the  bivariate  normal  scores  test  of  independence  (cf.  * 
Bhuchongkul  (1964))  relative  to  the  likelihood  ratio  test  is  preserved  in  the 

multivariate  case  the  same  is  not  true  in  the  case  of  multivariate  rank  sum 

(R) 
S  ,  ,.  test.   However,  for  special  cases  of  the  type  dealt  in  Sen  and  Puri  (1966) 

one  may  consider  finding  the  bounds  of  (8.31).   We  do  not  pursue  this  problem  in 

this  paper. 

Finally,  it  is  easy  to  notice  that  under  the  normal  theory  model,  in  the 

case  of  pair  wise  independence  (or  total  independence), 

(8.32)  e  ($),  U*  =  1 

(8.33)  e  (R),  U*=  -I 
*n 
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